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Abstract 


In 2003, Cohn and Umans proposed a group-theoretic approach to bounding the exponent of 
matrix multiplication. Previous work within this approach ruled out certain families of groups as 
a route to obtaining w = 2, while other families of groups remain potentially viable. In this paper 
we turn our attention to matrix groups, whose usefulness within this framework was relatively 
unexplored. 

We first show that groups of Lie type cannot prove w = 2 within the group-theoretic approach. 
This is based on a representation-theoretic argument that identifies the second-smallest dimension 
of an irreducible representation of a group as a key parameter that determines its viability in 
this framework. Our proof builds on Gowers’ result concerning product-free sets in quasirandom 
groups. We then give another barrier that rules out certain natural matrix group constructions 
that make use of subgroups that are far from being self-normalizing. 

Our barrier results leave open several natural paths to obtain w = 2 via matrix groups. To 
explore these routes we propose working in the continuous setting of Lie groups, in which we 
develop an analogous theory. Obtaining the analogue of w = 2 in this potentially easier setting 
is a key challenge that represents an intermediate goal short of actually proving w = 2. We give 
two constructions in the continuous setting, each of which evades one of our two barriers. 
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1 Introduction 


The exponent of matrix multiplication is the smallest number w such that for each £ > 0, there 
exists an algorithm for multiplying two n x n matrices using O(n“’T*) field operations. It is clear 
that w > 2, and a long line of work has led to the best upper bound currently known of w < 2.37286 
from [AW21]. It is a longstanding and well-known open problem to resolve the conjecture that 
w= 2. 

In [CU03] a group-theoretic approach to bounding w was proposed. Given any finite group G 
and three subsets of G satisfying a certain condition (the triple product property), this approach 
yields an upper bound on w by reducing an instance of matrix multiplication to multiplication 
in the group algebra of G. This approach can capture the Coppersmith—Winograd family of 
algorithms [CKSU05], which includes the current record bound. While some barriers are known for 
the group-theoretic approach, for instance that abelian groups cannot not prove w < 3 (see [CU03, 
Lemma 3.1]) or that certain generalizations of the constructions in [CKSU05] cannot achieve w = 2 
[BCC*17a, BCC*17b, Saw18], the possibility that one could show w = 2 using a suitable family of 
nonabelian groups remains wide open.! 

Previous work on the group-theoretic approach has focused mainly on families of very non-simple 
groups, i.e., groups built up from simple groups through repeated group extension. For example, 
the best bounds known on w can be obtained using a semidirect product of the symmetric groups 
with direct products of abelian groups. At the same time, the aforementioned barriers rule out 
these kinds of constructions and certain generalizations [BCCT17a, BCCt17b, Saw18]. But this 
has left the simple groups—in some sense the opposite end of the spectrum of finite groups—largely 
unexplored. 

In this paper we address this gap in knowledge by studying finite groups of Lie type? in the 
framework of [CU03]. This is an important class of groups that contains all of the finite simple 
groups except alternating or cyclic groups and finitely many sporadic groups. Some good examples 
to keep in mind are the classical matrix groups such as the group SL(n, q) of determinant 1 matrices 
over the finite field F, or the group of n x n orthogonal matrices over Fy with respect to a quadratic 
form. 


1.1 Results 


We start by showing that triple product property constructions (see Definition 2.1) in groups of Lie 
type cannot prove any bound on w better than 2+ for some absolute constant ¢ > 0 (Corollary 3.4). 
This resolves a question asked in [CU03]. Our proof combines a representation-theoretic argument 
with known bounds on the dimension and number of irreducible representations in groups of Lie 
type [LS74, FG12]. More broadly, we identify the second-smallest dimension of an irreducible 
representation as a key parameter of a group that determines its viability for the approach of 
[CU03]: Theorem 3.2 shows that groups where this quantity is large cannot yield good bounds on 
w. For example, any family of groups for which the second-smallest dimension of an irreducible 


‘Other barrier results rule out different generalizations of the Coppersmith—Winograd approach, such as [AFLG15, 
AW18, Alm21, CVZ21], but it remains unclear the extent of the implications those barriers have for the group-theoretic 
approach. 

Specifically, by a group of Lie type we mean any one of the (possibly twisted) Chevalley groups, including the 
Suzuki and Ree groups, or the quotient of such a group by its center. A Chevalley group is the fixed points of a 
Steinberg endomorphism in a semisimple algebraic group over a finite field (see Definition 21.6 and Table 22.1 in 
[MT11]). Among others, this list includes SL(n, q), SU(n, q), SO(2n + 1,q), Sp(2n, q), SOT (2n, q), and SOT (2n, q). 
Obtaining simple groups can require taking the quotient by the center, but that does not change our conclusions, such 
as Corollary 3.4. 


representation grows as a power of the size of the group cannot yield w = 2. It had been known 
since [CU03] that the largest dimension played a key role in the quality of the bound, but small 
dimensions were not previously understood to be relevant. 

This first barrier builds on Gowers’ theorem on product-free sets in quasirandom groups [Gow08]. 
We note that whereas Gowers’ result involves the minimum dimension of a nontrivial representation, 
the additional structure of our problem allows us to consider the second-smallest dimension of an 
irreducible representation (in other words, we can skip any other representations of dimension 1). 
This gives us lower bounds in groups where Gowers’ result does not apply, such as SL(n, q). It is 
interesting that while triple product property constructions in abelian groups cannot yield nontrivial 
bounds on w, this barrier shows that highly nonabelian groups also have significant limitations. 

Next we show in Theorem 3.6 that subgroups with large normalizers cannot be used in a triple 
product property construction to obtain w = 2. This barrier is particularly effective in the setting 
of matrix groups. For example, one cannot obtain w = 2 via a triple product property construction 
using three subgroups inside GL(n, q) for varying q and any fixed n, or even inside products of such 
groups. 

Our first barrier result rules out obtaining exponent 2 from finite groups of Lie type, but still 
leaves open the possibility that such groups could serve as building blocks in efficient algorithms for 
matrix multiplication. For example, the direct product of such groups escapes the barrier entirely, 
since the second-smallest dimension of an irreducible representation of a direct product equals 
the second-smallest dimension among the irreducible representations of the factors. Similarly, our 
normalizer barrier suggests that constructions should aim to use subgroups that are self-normalizing. 
We therefore view our barriers as giving us useful information about what a possible construction 
using finite groups of Lie type must look like, if it is to give w = 2. 

In the second part of the paper, we give constructions that naturally use a direct product in a 
critical way (Theorem 4.9), and constructions that use self-normalizing subgroups (Theorem 4.10). 
It is important to note that we know of no constructions in finite groups of Lie type that even meet 
a certain “packing bound” (Definition 2.3), a prerequisite for obtaining w = 2. This remains an 
important challenge. In lieu of such constructions, we direct our efforts at obtaining constructions in 
continuous Lie groups, which seems easier and mathematically cleaner to work in, and where we can 
ask direct analogues of the main questions. Here we have constructions that meet the packing bound. 
Moreover, we give examples that use direct products and self-normalizing subgroups, desiderata by 
the barrier results. 

Our constructions do not achieve the Lie analogue of beating exponent 3, and we suggest 
improving them and finding other examples as key challenges highlighted by this work. 


1.2 Outline 


In the next section we review the group-theoretic approach to bounding w. In Section 3 we give our 
barriers. We then introduce the Lie exponent in Section 4 and give our constructions. We conclude 
with some questions in Section 5. 


2 Background 


In this section we review the approach of [CU03] to bounding w. For a finite group G, let Irr(G) 
denote the set of equivalence classes of irreducible complex representations of G, and for X C G, let 
Q(X) = {xxt : 2,2’ € X} be the quotient set of X and X~! = {x71 : x € X} be its collection of 
inverses. 


Definition 2.1. [CU03, Definition 2.1] We say that subsets S, T, and U of a group G satisfy the 
triple product property if for all s € Q(S), t € Q(T), and u € Q(U), 


stu = 1 s=t=u=l1. 


The simplest case is when the three subsets of G are subgroups, in which case we do not need to 
take their quotient sets since Q(H) = H for every subgroup H. However, the known constructions 
that achieve nontrivial upper bounds for w do not use subgroups [CKSUO05]. 

Given three subsets of G that satisfy the triple product property, [CU03] shows how to reduce a 
matrix multiplication problem to convolution of functions on G (in other words, multiplication in 
the group algebra C/G]). This reduction yields the following inequality for the exponent of matrix 
multiplication w: 


Theorem 2.2. [CU03, Theorem 4.1] If S, T, and U satisfy the triple product property in G, then 


(IS||ITI|U)? < So ap, 


where d; E€ N are the dimensions of the irreducible representations of G. 


If |S], |T|, and |U| are large and the dimensions d; are not too large, then this inequality yields 
an upper bound for w. For example, if 


|S|ITI|U| > >) a, 


then it proves that w < 3. Using Theorem 2.2, it was shown in [CKSU05] that w < 2.41. In fact, 
this framework is powerful enough to capture the Coppersmith-Winograd family of algorithms, 
including the latest record of w < 2.37286 from [AW21]. 

In the notation of Theorem 2.2, we always have X`; d? > >>, d? = |G|. Thus, the inequality in 
the theorem never implies a better bound on w than (|$||T'||U|)*/3 < |G| would. This inequality is 
always consistent with w = 2, because the multiplication maps from S x T, T x U, and S x U to 
G must be injective to satisfy the triple product property, and therefore |S||T||U| < |G|?/2. This 
gives us a necessary condition for proving that w = 2 via Theorem 2.2: 


Definition 2.3. We say a sequence G1, G2,... of finite groups meets the packing bound if there 
exist subsets S;, T;, and U; of G; satisfying the triple product property such that 


[$4] |T; |Us] = |G, [8/2-e 
as i — oo. 


Several families of groups meeting the packing bound were constructed in [CU03]. In the 
terminology of [CU03], meeting the packing bound means the pseudo-exponent of G; converges to 2 
as i > oo. A simple argument [CU03, Lemma 3.1] shows that this can happen only if |G;| — oo. 

Meeting the packing bound is a necessary condition for Theorem 2.2 to yield w = 2: if a family 
of groups contains no sequence meeting the packing bound, then there is a constant £ > 0 such that 
no group in the family can prove an upper bound on w better than 2+ € via Theorem 2.2. However, 
meeting the packing bound is not in itself sufficient to achieve w = 2, or even w < 3. 


3 Barriers for matrix groups 


In this section we explain the two barriers mentioned in the introduction. Each of them is based on 
an idea that is particularly relevant for matrix groups, although we formulate the bounds in greater 
generality. 


3.1 <A representation-theoretic barrier 


We begin by proving our representation-theoretic barrier, which we then apply to groups of Lie 
type. Our proof of Theorem 3.2 follows the Fourier-analytic proof of Gowers’ theorem on mixing 
in quasirandom groups (see, for example, [Brel4, Lemma 2.2]). Our barrier is a function of the 
second-smallest dimension of an irreducible representation of G. Because we use this parameter 
frequently, we introduce notation for it: 


Definition 3.1. For a finite nonabelian group G, let n(G) = minzen(G): dimr>1 dimm be the 
smallest dimension of an irreducible representation of G of dimension greater than 1. 


Theorem 3.2. If subsets S, T, and U satisfy the triple product property in a finite nonabelian 
group G, then 

|G|3/2 
n(G)1/2 


Proof. Let 1x denote the indicator function of a subset X C G. For brevity, given X C G and 
T € Irr(G), we will write m(X) = X pex T(x) and dr := dim. 

Suppose that S,7',U satisfy the triple product property. Equivalently, the value at the identity in 
the 6-fold convolution 1g * 15-1 *1p*1p-1 *1ly *1ly-1 equals |S| |T| |U|. The Fourier inversion formula 
says that a function f: G — C can be reconstructed using the inner product (X,Y) = Tr(XY') 
(where Y denotes the entry-wise complex conjugate) as 


ozi E (Droo) 


mwElrr(G) heG 


ISITI |U] < 


+ |G]. 


Applying this formula to f = 1g * 1g-1 * lp * lp-1 * ly * ly-ı and g = 1 yields 


IGIISIITIUL= $> drTe(a(8S)r(9)r(T)a(T (U )r(U™)). 
rElrr(G) 


When d, = 1, 


m(S)r(S-1) = Y a(s) So T) = [a (S)P, 


ses ses 
which is a nonnegative real number, and 7(S)7(S~!) = |S]? if m is the trivial representation. Thus, 
IG [SITU] > (ISITI UD)? + XO dx Te(w(S)m(S~)a(L)m(L-*) aU) x(U*)) 
tT: dq>1 
= (ISIITIUN? + SY) de Te(a(S7!)x(L)n(L-")x(U)m(U-") (8). 
wt: dr>l 


By the Cauchy—Schwarz inequality, 


IGIISIITIIUI > (ISTUN? - SS dallal S TIT) a(r U) a(S), 
T: dr>l 


where || - || denotes the Frobenius norm of a matrix (i.e., |M]? = Tr(M@M')). 
Fourier inversion implies a nonabelian version of Parseval’s identity, which states that for any 


function f: G > C, 
2 


TO=- E al oro) 
iG 


gEG nElrr(G) gEG 


Applying this formula with f = 1g-1 * 17, which is equal to the indicator function of S~!T by the 
triple product property, we obtain 


ISIITIIGI= $D dlas D)’, 
rElrr(G) 


and thus for each 7 € Irr(G) with dr > 1, 


la(S D) < VISIITI IGI/n(G). 


Using this bound and the Cauchy—Schwarz inequality, we find that 


IGI ISI ITI |U] > (SITU)? — VISITIGI/n(@) $) dalia(T U) a(S) 


T: da> 
> (ISIT UD? — vVISIITIIGI/ aG), |] XO drla(T=0U) | XO  dellr(U-18)|2, 
m: dr>l m: dr>l 


and so by Parseval’s identity, 


IG] ISIITIU] > (S| ITO)? — VISIT IGl/n(G) VIG TIO] V1G1 ISIU] 
= (ISI |T||U))? — IS| |T| [U1 1GP nay”. 
We conclude that 
m 
Gy? 


[S| |U] < + IG], 


as desired. 


We immediately obtain the following corollary: 


Corollary 3.3. No sequence G1,G2,... of finite groups satisfying n(G;) > Q(\G;|°) with 6 > 0 can 
meet the packing bound. 


Corollary 3.4. There exists a constant € > 0 such that no triple product property construction in 
a group of Lie type can yield an upper bound on w better than 2 + €. 


This corollary is more subtle than the previous one, since it does not simply amount to a failure 
to meeting the packing bound. 


Proof. First, we deal with the case of groups of Lie type of bounded rank. Such groups G satisfy 
n(G) > Q(|G|°) for some constant 6 > 0, as one can check from the bounds given in [LS74], and 
this condition suffices by Corollary 3.3. 

Now let G be a group of Lie type of rank r and dimension d over F}. Then |G| = O(qf) (see, 
for example, [MT11, Table 24.1]), and the lower bound n(G) > Q(q") holds by [LS74]. Hence by 


Theorem 3.2, 
|S||Z||U| < IGI’ /./n(G) +|G| = Olga? ti), 


By [FG12, Theorem 1.1], there are O(q") conjugacy classes in G. Let d1, ..., dm be the dimensions 
of the irreducible representations of G, where m is the number of conjugacy classes of G. We have 


did = |G| = O(q’), and 


w 2\ w/2 
Dt p (Zid) 


m m 


since z œ x/? is a convex function. Hence X,d? > Q(q™**(-")/2), and therefore Theorem 2.2 
cannot yield an upper bound on w better than 


u <3 (2) 


r 


for some absolute constant C > 0. If r is large enough, then this bound cannot approach 2, and the 
case of bounded r was dealt with above. 


Note that this corollary holds not just for groups of Lie type, but also for simple groups that 
are quotients of groups of Lie type by their centers. In particular, the second part of argument 
depends on only two bounds, namely a lower bound for n(G) and an upper bound for the number 
of conjugacy classes in G. Both of these bounds are preserved by taking the quotient: a quotient 
group always has at most as many conjugacy classes as the original group, and every irreducible 
representation of the quotient yields an irreducible representation of the original group. 

Another consequence of Theorem 3.2 is a slightly sharper estimate for how close |S] |T| |U| can 
come to |G \3/ 2 when S, T, and U satisfy the triple product property in G. It follows from [CU03, 
Lemma 3.1] that |S| |T| |U| < |G|°/2, but this inequality does not rule out the possibility that |S], 
IT|, and |U| might be a large as ||G|!/? —1]. The following corollary shows that this cannot happen 
when |G| is sufficiently large. 


Corollary 3.5. If subsets S, T, and U satisfy the triple product property in a finite group G, then 
[S| IZ] |U] < |GR?/Vv2 + |G|. 


Proof. If G is abelian, then |S] |T||U| < |G| by [CU03, Lemma 3.1]. Othewise n(G) > 2 and the 
conclusion follows from Theorem 3.2. 


3.2 A barrier for subgroups that are not self-normalizing 


In contrast to the previous barrier, which follows from properties of the containing group, we now 
give a barrier in terms of the three subsets used in a triple product property construction. It will 
apply only to the case of three subgroups, as opposed to arbitrary subsets. 

For X C G, let N(X) = {g € G : gXg~! = X} denote the normalizer of X in G, and let 
Z(G) = {g € G: gh = hg for all h € G} denote the center of G. 


Theorem 3.6. Suppose that subgroups Hı, Hə, and H3 satisfy the triple product property in a finite 
group G, and let si =|N(H;)|/|Hi|.. Then 


|G|3/2 


Ay | | Ho| | H3| < ————_-. 
| 1l 2l | 3l — (s18283)1/4 


Proof. The main observation in this proof is that |H1| |N (H1) O H2||H3| < |G| (and the analogous 
inequality for any permutation of Hi, H2, and H3). To prove this inequality, we will show that the 
map 


(hı, ho, h3) E> hihehs 


is injective on Hy x (N(H1) O H2) x H3. If not, then there exist (h1, h2, ha) Æ (h4, ho, R4) for which 
hyhgh3 = hi hhh, 


which implies that 
hy tht hihehshy | = 1. 


However, hy (hT *h1)hh is another element h? € Hy (not equal to 1 if h} 4 h1), since h4 is in the 
normalizer of Hı. We thus have hi (hy ‘h2)(h3h'; +) = 1 with not all three factors equal to 1, which 
contradicts the triple product property for Hı, Hə, and H3. 

Now this inequality implies that 


|N(H1)| |H2| 


|N(H1)| |H2| 
|N (Ay) Ap| 


|G| > [H| |N (H1) A H| |.A3| = | H| iG) 


|Hs| > |H1| | 73]. 


The inequality in the theorem statement follows by repeating this argument with H2, H3 and then 
Hs, Hı in place of Hı and Hə and taking the product. 


Remark 3.7. The same proof in fact works for subsets S, T, U satisfying the triple product property, 
not just subgroups, and leads to the conclusion that 


ek 1/2 
IN(Q(S)) NTI |N(Q(T)) NU] LCE. l 
The following corollary shows that triple product property constructions using subgroups of 


groups G satisfying |Z(G)| = Q(|G|°) with 6 > 0 cannot meet the packing bound. For example, this 
shows that triples of subgroups in GL(n, q) with fixed n cannot meet the packing bound.’ 


ISIITIIUI < ( 


Corollary 3.8. If subgroups Hı, Hə, and H3 satisfy the triple product property in a finite group G, 


then 
las? 


|Z(G)|/?" 


Proof. Because Hı N Z(G), Hə N Z(G), and H3 N Z(G) satisfy the triple product property in the 
abelian group Z(G), 


| H| || | H3| < 


|Z(G)| > |H1 N Z(G)| |H2N Z(G)| |H; N Z(G)| 
by [CU03, Lemma 3.1]. Combining this inequality with Z(G) C N(H;) shows that 


|Z(G)| > |H1 N Z(G)| |H2 N Z(G)| | H3 N Z(G)| 
= |H] |H2| |H3| |Z (G)|* / (H1 Z(G)| |H2Z(G)| |H3Z(G))) 
> | H| |H2| |Ħ3| |Z(G)|*/ (H1 N(H1)| |H2N (H2)| |H3N(H3)]|) 
= |H1| |H2] |Ħ3] |Z (G) /(IN(H1)| |N (F2) |N(Ħ3)|), 


and therefore |N (H1)| |N (H2)| |N(43)|/(|H1| |H2| |H3|) > |Z(G)|?. The conclusion now follows by 
Theorem 3.6. 


3More generally, arbitrary subsets cannot meet the packing bound, because intersecting random translates of the 
subsets with SL(n, q) would give subsets of SL(n, q) meeting the packing bound in expectation, and we have seen that 
this is impossible since SL(n, q) a group of Lie type of bounded rank when n is fixed. 


4 Constructions in Lie groups 


In this section, we study triple product property constructions in Lie groups (i.e., groups that are 
also smooth manifolds). All Lie groups will be assumed to be positive-dimensional. We define the 
Lie exponent of a Lie group G in terms of the rank r(G), which we take to be the real dimension of 
a Cartan subalgebra of the Lie algebra.* 


Definition 4.1. The Lie exponent w(G) of a Lie group G of rank r(G) is the infimum of the 
quantity 
r(G) 
(dim Mı + dim Mz + dim M3)/3 — (dim G — r(G))/2 
over all submanifolds Mı, M2, and M3 of G satisfying the triple product property and (dim Mı + 
dim Mz + dim M3)/3 > (dim G — r(G))/2. (Recall that the infimum of the empty set is +00.) The 
Lie exponent of a family of groups is the infimum of w(G) over G in the family. 


We primarily have in mind semisimple Lie groups, or more generally reductive groups, and 
it is unclear how relevant the Lie exponent is for other groups. Note that if G is abelian, then 
r(G) = dimg G. 

Definition 4.1 is motivated by the following analogy with the finite field setting. The finite 
groups of Lie type fall into families of Chevalley groups defined over F, as q varies, with the families 
corresponding to the classification of simple Lie groups (as well as some complications such as 
twisting). For example, SL(n,q) is analogous to SL(n,R) or SL(n,C). Suppose we have triple 
product property constructions with subsets of sizes q7™te), qm2+0(1) and g™3+°) in such a 
family of simple groups Gy as q —> oo. This is a finite analogue of having submanifolds of dimensions 
mı, M2, and mg. It follows from [LMT13, Theorem 1.3] that the largest irreducible representation 
of G4 has dimension g(4-")/2+0) as q + œo, where d and r are the dimension and rank of the 
corresponding Lie group.” If the dimensions of the irreducible representations of Gy are di,..., dp, 
then by Theorem 2.2, 


qh tata +ol))w/3 < 3 d? 


(3 
< J d? max d?~* 
j J 
i 
= |G| max d?~? 
j J; 


= ge een ret) 


and taking the limit as q > co shows that 


w < i 
~ (mı + M2 + m3)/3 — (d — r) /2 


if the denominator is positive. In other words, Definition 4.1 is exactly the bound on w one would 
get if the construction had an analogue in the corresponding finite groups of Lie type. We note that 
we know of no general reason why such an analogue should exist; indeed, we do not know of any 
finite analogues of the Lie group constructions given later in this section. We pose the following 
question: 


“Note that this differs from the usual convention for complex Lie groups of using the complex dimension. For 
example, this is why Table 4.1 shows that r(GL(n, C)) = 2n. 

°To deduce this result from [LMT13, Theorem 1.3], note that the Steinberg representation has dimension q 
See also [LMT13, Theorems 5.1-5.3] for some classical groups that are not quite simple. 


(d-r)/2_ 


Question 4.2. Is it true that for every Lie group G, the exponent of matrix multiplication is at 
most w(G) ? 


By Proposition 4.3 below, the answer must be yes if w = 2. A direct proof would be of 
considerable interest. Even without such a proof, we view w(G) as a model for what groups can do 
in the continuous setting, which allows for geometric constructions that may not work over finite 
fields, but that we might hope give inspiration for related constructions in the finite setting. 


Proposition 4.3. Every Lie group G has w(G) > 2. 


Proof. If My, M2, and M3 satisfy the triple product property in G, then the map (m1, m2) > mī mo 
from Mı x Mə to G is injective, and so dim Mı + dim Mə < dim G. Similarly, dim Mz + dim M3 < 
dim G and dim Mı + dim M3 < dim Œ. Averaging these inequalities shows that (dim Mı + dim M2 + 
dim M3)/3 < (dim G)/2, and therefore the bound we obtain for w(G) is at least 


r(G) E 
(dim G)/2 — (dimG — r(G))/2 


Equality could hold only if dim M; = (dim G)/2 for all i. In that case, every continuous, injective 
function from Mı x Mə to G must be open by invariance of domain, and therefore has an open 
image. In particular, for m} € Mı and m4 € Mə consider the map sending (m1,m2) € Mı x M2 
to mim mem. Its image contains a neighborhood of 1, but by the triple product property it 
intersects the quotient set Q(M3) = M3M;' only at 1, which is impossible since Q(M3) contains a 
submanifold M3m3' (for fixed m3 € M3) of dimension (dim G)/2 that contains 1. 


2. 


Note that in the notation of Definition 4.1, if dim Mı + dim Mz + dim M3 < dim G, then they 
cannot prove any better upper bound for w(G) than 3. This conclusion is immediate if r(G) = dim G, 
and one can check that it follows from r(G) < dimG. In fact, the best upper bound we know on 
the Lie exponent is 3, which holds for abelian groups: 


Proposition 4.4. If G is abelian, then w(G) = 3. 


Proof. Let Hı = G and Hə = H3 = {1}. Then Hı, Hə, and H3 satisfy the triple product property 
in G. Since r(G) = dim G and dim H; + dim Hə + dim H3 = dimG, it follows that w(G) < 3. 

For the other direction, note that if G is abelian, then the product map Mı x Mə x M3 >G 
must be injective or else the triple product property fails. If the map is injective, then dim Mı + 
dim Mə + dim M3 < dim G and hence w(G) > 3. 


There is also a Lie analogue of the packing bound from Definition 2.3. 
Definition 4.5. We say a sequence G1, G2,... of Lie groups meets the packing bound if there exist 
submanifolds Mj), Mo;, and M3, of G; satisfying the triple product property such that 
lim dim G; 
i>oo (dim M1; + dim M2, + dim M2,)/3 


= 2. 


Proposition 4.6. If Lie groups G1, G2,... have limj_+. w(G;) = 2, then they achieve the packing 
bound. 


Proof. It suffices to show that for Mı, M2, and M3 satisfying the triple product property in a group 
G with (dim Mı + dim Mə + dim M3)/3 > (dim G — r(G))/2, 
r(G) = dimG 
(dim Mı + dim M2 + dim M3)/3 — (dim G — r(G))/2 ~ (dim Mı + dim Mə + dim M3)/3° 
This assertion follows from the inequality (dim Mı + dim Mə + dim M3)/3 < (dimG)/2 used in the 
proof of Proposition 4.3. 


It was shown in [CU03, Theorem 6.1] that the Lie groups SL(n,R) meet the packing bound, by 
taking Mı, M2, and M3 to be the groups of upper unitriangular, lower unitriangular, and orthogonal 
matrices. In this construction, the group is an algebraic group over R, as are the subgroups M;. 
In particular, they are all linear algebraic groups over R, i.e., subgroups of GL(n, R) defined by 
polynomial equations. Algebraic groups are a little more general than linear algebraic groups; they 
are to algebraic varieties as Lie groups are to manifolds. 

However, algebraic varieties over C (or any algebraically closed field) cannot help: 


Theorem 4.7. Let G be an algebraic group over an algebraically closed field, and let Vi, V2, and 
V3 be subvarieties of G that satisfy the triple product property. Then 


dim V; + dim Vz + dim V3 < dim G. 


As a consequence, subvarieties of algebraic groups over C cannot be used to meet the packing 
bound or obtain a better Lie exponent than 3. 


Proof. Let vi be any element of V;, and define y: Vi x V2 x V3 > G by 


y(v1, v2, v3) = vv wavy wog t. 
By the triple product property, the only solution of y(v1,v2,v3) = 1 is (vi, v3, v4), and so the 
solution set is zero-dimensional since we are working over an algebraically closed field. However, the 
fiber dimension theorem [Har92, Theorem 17.24] says the dimension of the solution set must be at 
least dim V; + dim V2 + dim V3 — dim G, which yields the desired inequality. 


The intuitive difference between R and C here is that a variety can have fewer points over R 
than one might expect by counting degrees of freedom. For example, the equation x? + --- +22 =0 
defines an (n — 1)-dimensional variety, which has plenty of points over C, but over R it consists of 
just a single point. Fields that are not algebraically closed may lead to an anomalously low number 
of solutions, and we cannot conclude that a variety is zero-dimensional just because it has only 
one real point. What Theorem 4.7 indicates is that to obtain strong examples, we must either use 
constructions that are not defined by polynomial equations, or choose equations that have fewer 
solutions over R than they do over C. (Note that there are many possibilities that are not defined 
by polynomial equations. For example, constructions that use complex conjugation generally do not 
define complex subvarieties. ) 

Theorem 4.7 does rule out one superficially attractive possibility, namely obtaining Lie exponent 2 
via subvarieties of algebraic groups over F, and then transitioning to finite groups by using finite 
subfields of F, with sizes tending to infinity. 

We now give several new constructions over R with parameters that improve upon the previously 
known construction from [CU03]. Our constructions make use of the following observation, which 
relaxes the triple product property for subgroups. 


Lemma 4.8. Suppose that Hı, Hə, and H3 are Lie subgroups of a Lie group G and K is a compact 
subgroup of G such that the equation hyhgh3 = 1 with h; € H; implies that hi, h2, h3 E K. Then the 
Lie exponent of G is at most 


r(G) 
(dim Hı + dim Hə + dim H3 — 2 dim K)/3 — (dim G — r(G))/2° 


We will refer to this situation as the K-triple product property. More generally, the same holds 
for submanifolds M; such that qiqoqg3 = 1 with q; E€ Q(M;) implies q1,q2,q3 E€ K, but we will need 
it only for subgroups. 
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Proof. By the slice theorem [Aud04, Theorem 1.2.1], there exist submanifolds H! of H; such that 
dim H} = dim H; — dim K and no two elements h, h’ € H! satisfy hh’! € K unless h = h’. Then 
the submanifolds H,, H5, and H of G satisfy the triple property property and yield the asserted 
bound. (Note that the first submanifold is H1, not Hj.) 


4.1 Asymptotic Lie exponent 3 


As mentioned above and shown in [CU03, Theorem 6.1], the upper unitriangular, lower unitriangular, 
and special orthogonal groups satisfy the triple product property in SL(n, R). Since these subgroups 
have dimension n(n — 1)/2 inside a group of dimension n? — 1, the groups SL(n, R) meet the packing 
bound. 

However, the rank of SL(n,R) is n — 1, and so this example does not yield any Lie exponent 
bound (the denominator in Definition 4.1 would vanish). In this section we modify the construction 
to get a bound on the Lie exponent approaching 3 for powers of SL(n, R). 


Theorem 4.9. Form > 1 andn>1, the Lie exponent of SL(n,R)™ is at most 


3m(n — 1) 
m(n—1)-—n 


In the proof we will denote the 7,7 entry of a matrix M by Mi j. To avoid ambiguity we will use 
superscripts to index sequences of matrices, with parentheses around the superscripts to distinguish 
them from exponents. Recall also that a unitriangular matrix is a triangular matrix with diagonal 
entries equal to 1. 


Proof. Let Hı = SO(n, R)”, let 
Hy = {(AQ), a Ay) E SL(n, R)” : each A is upper unitriangular}, 
and let 
H; = fo, ...,B™) € SL(n,R)™ : each B® is lower triangular and 12"? = 1 for all i} : 
i=1 


These subgroups have dimensions mn(n—1)/2, mn(n—1)/2, and m(n(n+1)/2—1)—n, respectively. 
We claim that they satisfy the K-triple product property in SL(n.R)™, where 


K ={(C®,...,0™) € SL(n,R)™ : each C® is diagonal with +1 entries}. 


Since dim SL(n, R)” = m(n? — 1) and the rank of SL(n,R)” is m(n — 1), while dim K = 0, the 
claimed bound on the Lie exponent will follow by Lemma 4.8. 

Let M = (M®,..., M) € Hı, A= (AM,...,A(™) € Hy, B= (BY,..., BOM) € Hy. We 
will show that if MA = B, then for each i, the matrix M is diagonal with +1 diagonal entries, in 
which case the same follows for A® and B®). We will prove by induction on j that M Me, = te; 
for all i, where e1,...,e, are the standard basis vectors. 

Let the jth column of A be AY For any i, AM = e1, and so MMe, = B®. Since MČ) is 


orthogonal, Me, and thus also B® must be unit vectors. In particular, B$] < 1 for all i. But 


since [[;4 Bi = 1 this forces BY = +1, which proves the claim for j = 1. 
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Now suppose M%e; = te; for all j < k and all i. For any i, Ae = ek + D j<k Afe; and 
(G) (à) 
B} = 


Bi pek +} jok BY ey. From the induction hypothesis we deduce that 


M%e, = MM (Ae — yA zej) 


j<k 
ek + 5 B Ce — S ALL (e). 
j>k j<k 
Since M is orthogonal, M® eg is a unit vector. Because I], a= = 1, it follows as above that 
BO = = +1 for all i. Hence Me; = +e, for all i, which proves the claim. 


4.2 Conjugates of rotation groups 


Theorem 4.10. There are three conjugates of O(n,R) inside of GL(n,R) satisfying the K-triple 
product property, where K is the subgroup of diagonal matrices with +1 entries on the diagonal. 


This construction meets the packing bound. In particular, it evades the normalizer barrier since 
the normalizer of O(n,R) in GL(n, R) is R* - O(n, R). However, it does not prove a bound for 
w(GL(n, R)), because each of these subgroups has dimension n(n—1)/2, which equals (dim GL(n, R)— 
n)/2, and r(GL(n,R)) = n. Note that the center of GL(n, R) plays no role, and we could just as 
well have stated the theorem for conjugates of SO(n, R) inside SL(n, R). We use the slightly more 
general formulation since it is convenient to allow determinant —1 in the proof by induction. 


Proof. Let G = GL(n,R) and H = O(n,R). To specify the conjugates of H, we take Dı = 
diag(z1,..., £n) with z1 > £2 >-+- > £n > 0 and Də = diag(y1,..., Yn) with O < y1 < y2 <- < 
Yn. Then we will show that H, Hy := D,HD;', and Hə := DH D3' satisfy the K-triple product 
property in G, where K is the group of diagonal +1 matrices. 

In particular, we will show that for every hı € Hı and hg € Hə, if hi hı = hj ha, then 
hy, ho E€ K. The conclusion then follows, since if Ahyhs" =I with h € H, then hyhy 1 € H, meaning 
(hyhz")' (hihg') = I, and hence hi hy = hg hə. 

Suppose that hı = DıMı D7! and hy = DM2D3*, where Mı, Mz € H, and consider 


hi hy = (Dy MY D,)(Di1M, Dz"). 
If (a1, @2,...,@n) is the first column of M1, then the first column of DıMı D7! is 
(a1, ToT] Aa, Jor „n£ an) 
as is the first row of DI 1M7 Dı, so the upper-left entry of their product h] hı is 
a? + (£2/x1) a2 + (z3/x1) a3 +++» + (an/a1)*a2. 
Now, since a? + a3 + --. +a? = 1, we can substitute for a? to obtain 
1+ ((@2/a1)? — 1)ad + ((w3/a1)? — 1)a3 +++ + ((@n/a1)? — lan 


Because z; > xı for all i > 1, this quantity is at most 1, with equality exactly when a3 = 
az =--- =a? = 0. By an identical argument, if (b1, b2,..., bn) is the first column of M3, then the 
upper-left entry of hd ha is 


1+ ((y2/y1)? — 1)b5 + ((ys/y1)? — 1)b3 +--+ + ((yn/yr)* — 1)b;, 
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Table 4.1: Parameters for the real, complex, and quaternionic versions of Theorem 4.10. 


(skew) field R C H 

dim G n? 2n? 4n? 

r(G) n 2n An 

dim H n(n—1)/2 n? n(2n+1) 
dim K 0 n 3n 

Meets packing bound as n + œ yes yes yes 

Lie exponent upper bound oe) 6 4 

which is at least 1, with equality exactly when be be ib = 0. So if hi hy = hd ha, then in 


particular their upper-left entries are equal, and we conclude that a? = b? = 1, while a; = b; = 0 for 
all ¿i > 1. 
Now hı has the form 


where h} is an element of D| O(n — 1,R)D{-' with Di, = diag(x2,..., £n), and hy has the form 


+1|0 ... 0 


ODO s 


where h} is an element of D} O(n —1,R)D5-* with D} = diag(y2,...yn). Finally, since hıhz'h = 1 
we find h also has the same block-diagonal form, and so h}(h,)~! € O(n — 1,R), and then we are 
done by induction on n. 


Remark 4.11. This theorem holds when we replace G with GL(n, C) (resp., GL(n, H)), H with 
U(n,C) (resp., Sp(n)), and K with the group of diagonal matrices with unit complex (resp., quater- 
nionic) numbers on the diagonal. This follows from a similar argument, where one replaces 
transpose with conjugate transpose and uses the positivity of the complex/quaternionic norm. The 
corresponding dimensions are shown in Table 4.1. 


5 Open problems 


The most important challenge highlighted by this paper is to find a construction proving that the 
Lie exponent of a family of Lie groups approaches 2, or to prove that such a construction cannot 
exist. Many questions can be asked along the way, including whether there is a Lie group with Lie 
exponent less than 3, and whether the Lie exponent of SL(n,R) is even finite. 

It follows from [Saw18] that triple product property constructions inside SL(2, q)™ cannot give 
w = 2 for fixed q and growing m, and Theorem 3.2 shows that SL(n,q)™ cannot give w = 2 for 
fixed m and growing q. The proofs of these two facts are quite different: one uses the polynomial 
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method, and the other is Fourier analytic. Is there a common generalization of these two facts that 
would rule out obtaining w = 2 with m and q both growing? 

Together with the fact that abelian groups cannot yield exponent less than 3, Theorem 3.2 
implies that the alternating groups are the only simple groups left that could yield w = 2 via a triple 
product property construction. The representation-theoretic argument fails in this case, since A, 
has an irreducible representation of dimension n — 1 but |A,,| = n!/2. Can an alternate argument 
rule out these groups? 


Acknowledgments 


This work was supported by an AIM SQuaRE grant. J. B. was supported by NSF grant DMS- 
1855784, J. A. G. was partially supported by NSF CAREER award CISE-2047756, and C. U. was 
supported by NSF grant CCF-1815607 and a Simons Foundation Investigator award. 


References 


[Alm21] Josh Alman, Limits on the universal method for matrix multiplication, Theory Comput. 
17 (2021), Paper No. 1, 30 pp. doi:10.4086/toc.2021.v017a001 


[AW 18] Josh Alman and Virginia Vassilevska Williams, Limits on all known (and some un- 
known) approaches to matriz multiplication, Proceedings of the 59th Annual IEEE 
Symposium on Foundations of Computer Science (FOCS 2018), IEEE, 2018, pp. 580-591. 
doi:10.1109/FOCS.2018.00061 


[AW21] Josh Alman and Virginia Vassilevska Williams, A refined laser method and faster matrix 
multiplication, Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms, 
(SODA 2021), SIAM, 2021, pp. 522-539. doi:10.1137/1.9781611976465.32 


[AFLG15] Andris Ambainis, Yuval Filmus, and François Le Gall, Fast matrix multiplication: 
limitations of the Coppersmith-Winograd method, Proceedings of the 2015 ACM 
Symposium on Theory of Computing (STOC 2015), ACM, 2015, pp. 585-593. 
doi:10.1145/2746539.2746554 


[Aud04] Michele Audin, Torus actions on symplectic manifolds, revised ed., Progress in Mathe- 
matics, vol. 93, Birkhauser Verlag, Basel, 2004. doi:10.1007/978-3-0348-7960-6 


[BCC*17a] Jonah Blasiak, Thomas Church, Henry Cohn, Joshua A. Grochow, Eric Naslund, 
William F. Sawin, and Chris Umans, On cap sets and the group-theoretic approach to 
matrix multiplication, Discrete Anal. (2017), Paper No. 3, 1-27. doi:10.19086/da.1245 


[BCC*17b] Jonah Blasiak, Thomas Church, Henry Cohn, Joshua A. Grochow, and Chris Umans, 
Which groups are amenable to proving exponent two for matrix multiplication?, preprint 
(2017), arXiv:1712.02302. 


[Bre14] Emmanuel Breuillard, A brief introduction to approximate groups, Thin groups and 
superstrong approximation, Math. Sci. Res. Inst. Publ., vol. 61, Cambridge Univ. 
Press, Cambridge, 2014, pp. 23-50. http: //library.msri.org/books/Book61/files/ 
15breu. pdf 


14 


[CVZ21] 


[CKSU05] 


[CU03] 


[FG12] 


Gow08] 


Har92] 


LS74] 


[LMT 13] 


[MT 11] 


[Saw18] 


Matthias Christandl, Péter Vrana, and Jeroen Zuiddam, Barriers for fast matrix 
multiplication from irreversibility, Theory. Comput. 17 (2021), Paper No. 2, 32 pp. 
doi:10.4086/toc.2021.v017a002 


Henry Cohn, Robert Kleinberg, Balázs Szegedy, and Christopher Umans, Group- 
theoretic algorithms for matrix multiplication, Proceedings of the 46th Annual Sym- 
posium on Foundations of Computer Science (FOCS 2005), IEEE Computer Society, 
2005, pp. 379-388. doi:10.1109/SFCS.2005.39 


Henry Cohn and Christopher Umans, A group-theoretic approach to fast ma- 
triz multiplication, Proceedings of the 44th Annual Symposium on Foundations 
of Computer Science (FOCS 2003), IEEE Computer Society, 2003, pp. 438-449. 
doi:10.1109/SFCS.2003.1238217 


Jason Fulman and Robert Guralnick, Bounds on the number and sizes of conjugacy 
classes in finite Chevalley groups with applications to derangements, Trans. Amer. Math. 
Soc. 364 (2012), no. 6, 3023-3070. doi:10.1090/S0002-9947-2012-05427-4 


W. T. Gowers, Quasirandom groups, Combin. Probab. Comput. 17 (2008), no. 3, 
363-387. doi:10.1017/S0963548307008826 


Joe Harris, Algebraic geometry: a first course, Graduate Texts in Mathematics, vol. 
133, Springer-Verlag, New York, 1992. doi:10.1007/978-1-4757-2189-8 


Vicente Landazuri and Gary M. Seitz, On the minimal degrees of projective representa- 
tions of the finite Chevalley groups, J. Algebra 32 (1974), 418-443. doi:10.1016/0021- 
8693(74)90150-1 


Michael Larsen, Gunter Malle, and Pham Huu Tiep, The largest irreducible repre- 
sentations of simple groups, Proc. Lond. Math. Soc. (3) 106 (2013), no. 1, 65-96. 
doi:10.1112/plms/pds030 


Gunter Malle and Donna Testerman, Linear algebraic groups and finite groups of Lie 
type, Cambridge Studies in Advanced Mathematics, vol. 133, Cambridge University 
Press, Cambridge, 2011. doi:10.1017/CBO9780511994777 


Will Sawin, Bounds for matchings in nonabelian groups, Electron. J. Combin. 25 (2018), 
no. 4, Paper No. 4.23, 1-21. doi:10.37236/7520 


15 


